Mining of Users’ Access Behaviour for Frequent Sequential Pattern from Web Logs

نویسنده

  • S. Vijayalakshmi
چکیده

Sequential Pattern mining is the process of applying data mining techniques to a sequential database for the purposes of discovering the correlation relationships that exist among an ordered list of events. The task of discovering frequent sequences is challenging, because the algorithm needs to process a combinatorially explosive number of possible sequences. Discovering hidden information from Web log data is called Web usage mining. One common usage in web applications is the mining of users’ access behaviour for the purpose of predicting and hence pre-fetching the web pages that the user is likely to visit. The aim of discovering frequent Sequential patterns in Web log data is to obtain information about the access behaviour of the users. Finding Frequent Sequential Pattern (FSP) is an important problem in web usage mining. In this paper, we explore a new frequent sequence pattern technique called AWAPT (Adaptive Web Access Pattern Tree), for FSP mining. An AWAPT combines Suffix tree and Prefix tree for efficient storage of all the sequences that contain a given item. It eliminates recursive reconstruction of intermediate WAP tree during the mining by assigning the binary codes to each node in the WAP Tree. Web access pattern tree (WAP-tree) mining is a sequential pattern mining technique for web log access sequences, which first stores the original web access sequence database(WASD) on a prefix tree, similar to the frequent pattern tree (FP-tree) for storing non-sequential data. WAP-tree algorithm then, mines the frequent sequences from the WAP-tree by recursively re-constructing intermediate trees, starting with suffix sequences and ending with prefix sequences. An attempt has been made to AWAPT approach for improving efficiency. AWAPT totally eliminates the need to engage in numerous reconstructions of intermediate WAP-trees during mining and considerably reduces execution time.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Performance Improvements and Efficient Approach for Mining Periodic Sequential Access Patterns

Surfing the Web has become an important daily activity for many users. Discovering and understanding web users’ surfing behavior are essential for the development of successful web monitoring and recommendation systems. To capture users’ web access behavior, one promising approach is web usage mining which discovers interesting and frequent user access patterns from web usage logs. Web usage mi...

متن کامل

Knowledge Discovery from Web Usage Data: Research and Development of Web Access Pattern Tree Based Sequential Pattern Mining Techniques: A Survey

Sequential pattern mining is the process of applying data mining techniques to a sequential database, to extract frequent subsequences to discover correlation that exists among the ordered list of events. Web Usage mining (WUM) discovers and extracts interesting knowledge/patterns from Web logs is one of the applications of Sequential Pattern Mining. In this paper, we present a survey of the se...

متن کامل

Mining Constraint-based Multidimensional Frequent Sequential Pattern in Web Logs

In this paper we introduce an efficient strategy for discovering Web usage mining is the application of data mining techniques to discover usage patterns from Web data, in order to understand and better serve the needs of Web-based applications. Web usage mining consists of three phases, namely preprocessing, pattern discovery, and pattern analysis. This paper describes each of these phases in ...

متن کامل

Binary Coded Web Access Pattern Tree in Education Domain

Web Access Pattern (WAP), which is the sequence of accesses pursued by users frequently, is a kind of interesting and useful knowledge in practice. Sequential Pattern mining is the process of applying data mining techniques to a sequential database for the purposes of discovering the correlation relationships that exist among an ordered list of events. WAP tree mining is a sequential pattern mi...

متن کامل

Effective web log mining and online navigational pattern prediction

The web has become the world's largest repository of knowledge. Web usage mining is the process of discovering knowledge from the interactions generated by the user in the form of access logs, cookies, and user sessions data. Web Mining consists of three different categories, namely Web Content Mining, Web Structure Mining, and Web Usage Mining (is the process of discovering knowledge from the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010